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The seceder model illustrates how the desire to be different than the average can lead to formation of groups in 
a population. We turn the original, agent based, seceder model into a model of network evolution. We find that 
the structural characteristics our model closely matches empirical social networks. Statistics for the dynamics of 
group formation are also given. Extensions of the model to networks of companies are also discussed. 
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[. INTRODUCTION 

Social networks have "community structure" — actors (ver- 
tices) with the same interests, profession, age (and so on), 
organize int o tig htly connected subnetworks, or communi- 
ties. i2 It l22l |241) Subnetworks are connected into larger con- 
glomerates into a hierarchical structure of larger and more 
loosely connected structures. Over the last few years the 
issue of communities in social networks has ventured be- 
yond sociology into the area of physicists' network stud- 
ies B fT7ll4ll) . The problem how to detect and quantify com- 
munity structure in networks has been the topic of a number 
papers i2 It l43l [46), whereas a few other have been models 
of networks with community structure I27il35il4^) . In these 
models, the common properties defining the community are 
external to the network evolution (in the sense that an individ- 
ual does not choose the community to belong to by virtue of 
his or her position in the network). In this paper we present 
a model where the community structure emerge as an effect 
of the agents personal rationales. We do this by constructing 
a networked version of an agent based model — the seceder 
model O O E H3)— of social group formation based on 
the assumption that people actively tries to be different than 
the average. Independence and the desire to be different plays 
an important role in social group formation (123) . this might be 
even more important in the social networking of adolescents. 
The important observation is that few wants to be different 
than anyone else, rather one tries to affiliate to non-central 
group. This type of mechanisms are probably rather ubiqui- 
tous, so the connotations of eccentricity are not intended for 
the name of the model. (See Ref. (51) for a non-scientific 
account of the formation of youth sub-cultures by these and 
similar premises.) 

Another system where the networked seceder model can 
serve as a model — or at least a direction for extension of 
present models (see e.g. Ref. (33))) — is networks where the 
vertices are companies and the edges indicate a similar niche. 
(Such edges can be defined indirectly using stock-price corre- 
lations (121).) The establishment of new companies are natu- 
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rally more frequent in new markets. Assuming new markets 
are remote to more traditional markets, the networked seceder 
model makes a good model of a such company networks. 



II. PRELIMINARIES 
A. Notations 

The model we present produces a sequence of graphs {G,}. 
Each member of this sequence consists of the same set V of 
N vertices, and a time specific set of M undirected edges E,. 
The model defines a Markov process and is thus suitable for 
a Monte Carlo simulation. The number of iterations of the 
algorithm defines the simulation time t — 1, ■ • • , ? max . 

We let d(i, j) denote the distance (number of edges in the 
shortest path) between two vertices i and j. We will also need 
the eccentricity defined as the maximal distance from i to any 
other vertex. 



B. The seceder model 

The original seceder model l fl6l) is based on N individuals 
with a real number s(i) representing the traits (or personality) 
of individual i. The algorithm is then to repeat the following 
steps: 

1. Select three individuals z'i, z'2 and z'3 with uniform ran- 
domness. 

2. Pick the one (we call it i) of these whose s- value is far- 
thest away from the average [s(i\) + sfo) + s(ii,)]/3. 

3. Replace the s-value of a uniformly randomly chosen 
agent with s(i)+T], where 77 is a random number from the 
normal distribution with mean zero and variance one. 

Note that the actual values of s is irrelevant, only the differ- 
ences between s of different agents. The output of the seceder 
model is a complex pattern of individuals that stick together 
in well-defined groups. The groups has a life-cycle of their 
own — they are born, spawn new groups and die. Statistical 
properties of the model is investigated in Ref. ( 16), effects of 
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FIG. 1 Illustration of the networked seceder model, (a) In step^ 
three vertices, i\, i 2 and i 3 , are chosen at random, (b) In step [2] the 
least central of the three vertices is relabeled to i. In step|5|a vertex 
j is selected at random and (c) the edges of j are rewired to i and 
is neighborhood (and to a set of random other vertices if necessary. 
Note that, in (c), j is moved to the cluster it is rewired to. In step[3]/s 
edges are rewired with a probability p. The shaded areas represent 
tightly connected subgraphs. 



a bounded trait-space is studied in Ref. ( 14), the fitness land- 
scape is the issue of Ref. (15), and Ref. ( 50) presents a gener- 
alization to higher-dimensional trait-spaces. 

Our generalization of this model to a network model based 
on the idea that if the system is embedded in a network, then 
the difference in personality is implicitly expressed through 
the network position, so the identity number (or vector) s be- 
comes superfluous. I.e., the homophily assumption (34) — that 
like attracts like — means that the difference in character be- 
tween two vertices i and j (defined as \s(i) - s(j)\ in the tradi- 
tional seceder model) can be estimated by the graph distance 
d(i, j) in a networked model. The model we propose is then, 
starting from any graph with N vertices and M edges, to iter- 
ate the following steps: 

1. Select three different vertices z'i, ^ and z'3 with uniform 
randomness. 

2. Pick the one i of these that is least central in the follow- 
ing sense: If the graph is connected vertices of highest 
eccentricity are the least central. If the graph is discon- 



nected the most eccentric vertices within the smallest 
connected subgraph are the least central. If more than 
one vertex is least central, let i be a vertex in the set of 
least central vertices chosen uniformly randomly. 

3. Choose a vertex j by uniform randomness. If degy < 
deg/ + 1, rewire j's edges to i and a random selection 
of z's neighbors. If deg j > deg i + 1, rewire j's edges to 
i, i's neighborhood and (if deg j > deg i + 1) to deg j - 
deg / - 1 randomly selected other vertices. 

4. Go through j's edges once more and rewire these with 
a probability p to a randomly chosen vertex. 

The rewiring of steps [5] and |4] are performed with the restric- 
tion that no multiple edges or loops (edges that goes from a 
vertex to itself) are allowed. Steps [2 to [3] correspond rather 
closely to the same steps of the original model. That j's edges 
are rewired mainly to the neighborhood of i (and / itself) re- 
flects the inheritance of trait value of the original model — by 
the homophily assumption the neighborhood of i will have 
much the same personality as / itself. The main difference 
between original and the networked seceder model is step 
0] where some vertices are rewired to distant vertices. The 
motivation for this step is that long-range connections exists 
in real-world networks ( 15 2i 1531) . and can in some situations 
be even more important than the strong links of a cohesive 
group (23). This kind of rewiring, to obtain long-range con- 
nections has been used to model "small-world behavior" of 
networks J53I) (i.e. a logarithmic, or slower, scaling of the av- 
erage inter-vertex distance for ensembles of graphs with the 
same average degree mw ). 

To make the model consistent we also have to specify the 
initial graph. As far as we can see, at least for finite p, this 
choice is irrelevant — the structure of the generated graphs are 
the same (or at least very similar). We will not investigate 
this point further. Instead we fix the initial graph to an instant 
of Erdos and Renyi's random graph model tVA) (for a modern 
survey of this model, see Ref. $26t) ): A graph with N edges and 
M edges is constructed by starting from isolated vertices and 
then iteratively introduce edges between vertex-pairs chosen 
by uniform randomness and with the restriction that no multi- 
ple edges or loops are allowed. To be sure that the structure of 
the random graph is gone we run the construction algorithm 
ION sweeps through every vertex before the graph is sampled. 
(We justify this number a posteriori below.) 

An illustration of the construction algorithm can be seen in 
Fig-E An realization of the algorithm is displayed in Fig. [5] 
The p-value of this realization is zero. For the value p = 0. 1 
we use in most simulations the community structure is less 
visible to the eye. Nevertheless — as we will see — the com- 
munity structure is still substantial for much larger values of 
P- 



C. Detecting communities 

To analyze the structure of cohesive subgroups in our model 
networks we use the community detection scheme presented 
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FIG. 2 One realization of the networked seceder model. The model parameters are N = 50, M - 150 and p = 0. The indicated groups are 
identified with Newman's clustering algorithm (see Sect. lll.Ll . This realization have modularity Q = 0.575, clustering coefficient C = 0.530, 
and assortative mixing coefficient r = 0.0456. 



m Ref. (HI). This algorithm starts from one-vertex clusters 
and (somewhat reminiscent of the algorithm in Ref. (10)) iter- 
atively merges clusters to form clusters of increasing size with 
relatively few edges to the outside. The crucial ingredient in 
the scheme is a quality function 

Q' = Yj( e ss-<Z) (1) 

seS 

where S is the set of subnetworks at a specific iteration of the 
algorithm and e ss > is the fraction of edges that goes between a 
vertex in s and a vertex in s', and a s = 2 S , e ss >. The algorithm 
performs a steepest-accent in Q -space — at each iteration the 
two clusters that leads to the largest increase (or smallest de- 
crease) in Q' are merged. The iteration having the highest Q' 
value — which defines the modularity Q — gives the partition 
into subgroups. 

D. Conditional uniform graph tests 

One can argue that some network structures are more ba- 
sic than other. Given such an assumption and a network G, 
an interesting issue is whether a certain structure, say X, is 
an artifact of a more basic structure, say Y. One way to 
do this is by a conditional uniform graph test: One com- 
pares the value of X(G) with X averaged over an ensemble 
of graphs with a the value of Y fixed to Y(G). This has (since 
Ref. (1291) ') been a well established technique in social network 
analysis and has recently been brought over to physicists' (1321) 
and biologists ' (© network literature. A common assump- 
tion J32I E7L 0) is that the degree distribution is such a very 
basic structure. We make this assumption too and perform a 
conditional uniform graph test with respect to the degree se- 
quence of the networks. To sample networks with a given de- 
gree sequence we use the idea of Ref. ( l47i) to rewire the edges 
of the network in such a way that the degree sequence remains 
unaltered. More precisely we go through all edges (z, f) e E 
and perform the following: 

1. Construct the set E' of edges such that if € E' 
then replacing (2, f) and (z, j) by (z, j) and (i, j) would 
not introduce any loops (self-edges) or multiple edges. 



2. Pick an edge (z, j) e E' by uniform randomness. 

3. Rewire (z, j) to (z, )) and (z, j) to (z, j). 

For every realization of the seceder algorithm we sample 
"sample = 10 randomized reference networks as described 
above. The motivation for this rather low number is that 
all quantities seems to be self-averaging (the fluctuations de- 
crease with AO and many have symmetric distributions with 
respect to rewirings (which makes many realization averages 
compensate for few rewiring averages). To further motivate 
this small « sa mpie we compare with « sam pie = 100 for the small- 
est size (N = 200, which, as mentioned, is most affected by 
fluctuations) and find that the quantities typically differ by 
0.5% which we consider small. 



III. THE COMMUNITY STRUCTURE OF THE SECEDER 
MODEL 

The key quantity capturing the degree of community order 
in the network is the modularity Q (defined in Sect. 111. Cl ). In 
Fig-EJa) we see that, if the average degree and p is kept con- 
stant Q converges to a high value, Q « 0.64 for p - 0.1 and 
M = 3N. This value is much higher than the reference value 
from the randomized networks — this curve has a peak around 
N = 1500 and decays for larger N, larger sizes would be 
needed to see of Q converges to a finite value for the random- 
ized networks. With the analogy to the Watts-Strogatz model 
(where a fraction p of a circulant's (13) edges is rewired ran- 
domly) we would say that p — 0. 1 is a rather high value, still Q 
is much higher for the networked seceder model than for ran- 
dom networks with the same degree distribution. From this 
we conclude that our model fulfills its purpose — it produces 
networks with a pronounced community structure just as the 
original seceder model makes agents divide into well-defined 
groups in trait-space. In Fig. |2Jb) we plot the M-dependence 
of Q for fixed N = 600 and p = 0.1. We see that Q decreases 
with M for both the seceder model and the randomized net- 
works. As M approaches its maximum value AMW - l)/2 
the curves will converge (since the fully connected graph is 
unique), but the figure shows that the curves are separated for 
a wide parameter range. More importantly it suggests that the 
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FIG. 3 The modularity Q as a function of the model parameters, (a) 
shows Q as a function of N with M = 32V and p = 0. 1 . (b) displays 
Q for different M for N = 600 and p = 0.1. In (c) we plot the p 
dependence of Q for N = 200 and M = 600. The gray line in (a) is a 
fit to a exponential. All errorbars are smaller than the symbol size. 
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FIG. 4 The number of groups b as a function of the systems size 
N. The other parameter values are M = 3N and p = 0.1. The line 
is a fit to a power-law ah®. For this set of parameters /3 = 0.400(6) 
for the seceder model and 0.193(6) for the reference-graphs of the 
conditional uniform graph test. All errorbars are smaller than the 
symbol size. Note the double-logarithmic scale. 



quantity Q should be rescaled by some appropriate function 
if networks of different average degree are to be compared. 
In the rest of our paper, however, we will keep the degree 
constant. In Fig. 0c) we show the p-dependence of Q. As ex- 
pected Q decays monotonously, in fact almost linearly, with p. 
The curves for the seceder model converges to the curve of the 
randomized networks as p — » 0. Q of the randomized refer- 
ence networks is almost /^-independent. The fact that it is not 
completely p-independent means that the degree distribution 
of the seceder model must vary with p. We will strengthen 
this claim later. 

Fig. 0] shows the size-dependence of b — the number of 
groups. We see that this function can be well-described by 
a constant plus a power-law, 



A + 1 



(2) 



(where A is a constant) with an exponent B - 0.400(6) for the 
seceder model and B = 0.193(6) for the random networks with 
the same degree distribution. The average community-size 
is given by N/b and will therefore also behave as a power- 
law, with exponent 1 - B = 0.600(6). This fact that for the 
number and average size of the communities grows with N 
does not seem contradictory to the real world to us. Since 
a community, both in a social and economical interpretation 
of the model, does not need to be controlled or supervised 
there is no natural upper limit to the number of community 
members. Furthermore, there is no particular constraint on 
the number of communities present in real world systems. A 
thorough study of the scaling-exponents would be interesting, 
but falls out of the scope of the present paper. 

In Fig.|5]we display the average geodesic lengths within a 
community /i ntra and between vertices of different communi- 
ties Zi n ter for parameter values M = 3N and p = 0.1. To be 
precise, we consider the largest connected component (which 
typically contains 99% of the vertices), and define 
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is the number of pairs of vertices belonging to the same com- 
munity. As seen in Fig.|5ja) and (b) both /j n t m and Zi nter grows 
logarithmically as functions of with the same slope in a 
semi-logarithmic plot. A logarithmic scaling of the average 
shortest path length (which of course also holds) is expected 
(cf. Ref. I ll II) ). But we could not anticipate the lack of qualita- 
tive difference between distances between vertices of the same 
an different clusters. The actual values of /; ntra is significantly 
smaller than /j nt er and this difference holds as — > 00: As seen 
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FIG. 6 Degree distribution of the networked seceder model. The 
model parameters are N = 1800, M = 5400 and p = 0.1. The 
squares indicate the degree distribution of a random graph with the 
sizes (N and M), i.e., the initial network before the iterations of the 
seceder model commence — see Eq. ^5}. 



in Fig.|3c) /inter - 'intra converges to 0.60(1). The same value 
for the randomized graphs is Zi nt er - 'intra = 0.204(8) which 
is expected — the detected communities in the networked se- 
ceder model are more well-defined and tight-knit that the cor- 
responding communities in a random network with the same 
degree distribution. 



A. Degree distribution 

Following the works of Barabasi and coworkers (H|;|5|fl the 
degree distribution has been perhaps the most studied network 
structure. Many of these studies have found a skewed, power- 
law tailed, degree-distribution. In some social networks — of 
telephone calls ( 1 ), e-mail communication <fl8h and the net- 
work of sexual contacts (13 ll) — authors have found large tails 
of the degree distribution that fits well to a power-law func- 
tional form. Other social network studies report degree dis- 
tributions with large degree cut-offs, these contain network 
of movie actor |4), scientific collaborations (37) or Internet 
community interaction J25I) or romantic interaction among 
High School students (the network of Ref. (8) as studied in 
Ref. i4]\t ). Yet other studies have found social networks with 
Gaussian degree distributions (the acquaintance networks of 
Refs. J20I) and (0) studied in Ref. ([^), or exponential degree 
distributions (of e-mail networks (1241 1421) ~). We conclude that 
the degree distribution of social networks is still an open ques- 
tion with, most likely, not a single solution — different social 
networks may follow different degree distributions. The de- 
gree distribution of the networked seceder model is displayed 
in Fig. [6] We note that P(k) has an exponential tail, notably 
larger than the Poisson degree distribution (Tnl) 



P(k) = e - 1 — 
V ; k\ 



(5) 



IV. OTHER STRUCTURAL CHARACTERISTICS 



(where k = 2M/N is the mean degree) of the initial random 
graph, but far from as wide as a power-law. Clearly this falls 
into one of the cases mentioned above. We note that as p 
grows the degree distribution gets closer to the original net- 
work (this was anticipated in Sect. lIIH . 



Apart from the quantities of the previous section, all di- 
rectly related to the community structure, we also look at 
some other well established structural measures: The clus- 
tering coefficient, the assortative mixing coefficient and the 
degree distribution. 



B. Clustering coefficient 

The clustering coefficient C measures the fraction of con- 
nected triples of vertices that form a triad. This type of statis- 
tics has been popular since Ref. (53). The definition we use is 
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FIG. 7 Common structural measures, (a) shows the clustering coef- 
ficient as a function of the number of vertices for the seceder model 
and rewired networks, (b) displays the corresponding plot of the as- 
sortative mixing coefficient. The network parameters are M = 3N 
and p = 0.1. Error bars are shown if they are larger than the symbol 



slightly different from that of Ref. 
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FIG. 8 Illustration of the identification of clusters at consecutive 
time steps. The vertex set is represented by the horizontal line. The 
vertical tics demarcate cluster boundaries. The communities at con- 
secutive time step is matched so that the overlap (the horizontal sum 
of shaded segments) is maximized. 



C. Assortative mixing coefficient 

The assortative mixing coefficient d38l) is the Pearson cor- 
relation coefficient of the degrees at either side of an edge: 



4<fr k 2 ) - fa + k 2 ) 2 
2{k 2 . + k 2 ) - {h + k 2 ) 2 



(7) 



(6) 



where subscript i denotes the ith argument and average is over 
the ed ge s et, r is known to be positive in many social net- 
works (38; 39) (networks of online interaction does not seem 
to follow this rule J2^) ). It has been suggested that this as- 
sortative mixing can be related to community structure (l44f> . 
Against this backdrop it is pleasing, but not surprising, to note 
that the networked seceder model produces networks with 
markedly positive r, see Fig. Gib)- The reference networks 
with the same degree sequences conver ges to zero from neg- 
ative values, as also observed in Ref. (1251) . It has been ar- 
gued 1451) that networks formed by agents without any 
preference for the degrees of the neighboring vertices gets 
negative r from the restriction that only one edge can go be- 
tween one pair of vertices. This is probably the reason for the 
negative r values of the rewired networks. 



where c(n) denotes ten number of representations of circuits 
of length n and p{n) denotes the number of representations of 
paths of length n. (By 'representation' we mean an ordered 
triple such that one vertex is adjacent to the vertex before or 
after. For example, a triangle has six representations — all per- 
mutations of the three vertices.) This definition is common in 
sociology (although sociologists emphasize triad statistics for 
directed networks) — see Ref. (30) for a review — but is also 
frequent in physicists' literature since Ref. (|7|). A plot of C 
as a function of N is shown in Fig. Et a )- We see that C for 
the seceder model converges to a constant value rather rapidly. 
Similarity the C for the rewired networks goes to zero roughly 
over the same time scale. The fact that community structure 
induces a high clustering is well known and modeled ( 401) . as 
is the fact that the clustering vanishes like 1 /N in a random 
graph with Poisson degree distribution 1411) . 



V. CHARACTERISTICS OF COMMUNITY DYNAMICS 

In this section we look at the dynamics of the communi- 
ties. To do this we need criteria for if a cluster B' k at time t is 
the same as cluster BJj 1 at time t — 1. The idea is to find the 
best possible matching of vertices between the partition into 
clusters of the two consecutive time steps. To give a mathe- 
matical definition, let B, = [B], • • ■ , Z?f w } be the partition of 
G, into clusters by the algorithm described in Sect. III. CI and 
let V — min(b(f), b{t - 1)). Now we define a mapping / from 
b' elements of [l,b(t- 1)] to V elements of [1, /?(?)] such that 
the overlap 

b' 

k=l 
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FIG. 9 Community dynamics for a typical run with the parameter values N = 100, M = 300 and p = and 100 iterations of the networked 
seceder model. The different panels shows different statistics for one single run of the algorithm, (a) shows the time evolution of the assortative 
mixing coefficient, (b) shows the clustering coefficient C. (c) shows the modularity Q. (d) shows the maximal overlap y between consecutive 
time steps, (e) illustrates the time evolution of the communities. A vertical cross section of (e) gives the respective relative sizes of the different 
clusters. The clusters are sorted horizontally according to age — the oldest clusters are in the top of the panel. 



is maximized (| • | denotes cardinality). Let y{t) denote this 
maximized y' t value. To calculate this overlap we use the 
straightforward method of testing all matchings. In principle 
this algorithm runs in exponential time, but since the number 
of groups is typically rather low systems of a few hundred 
vertices numerically tractable. 

The evolution of the group structure, with the group struc- 
ture identified as described above, is displayed in Fig. [9] In 
Figs-EJa) and (b) we see the time evolution of the assortative 
mixing coefficient r and the clustering coefficient C, whose 
average size-scaling was studied in Sect. lIVI We note that the 
assortative mixing coefficient fluctuates rather much. Even 
though it is mostly positive (remember that the average value 
is significantly positive) it also have rather pronounced nega- 
tive values. This is likely to be a finite size phenomenon — as 
the assortative mixing coefficient is self-averaging ( 25), larger 
systems would not fluctuate much and have stable positive 
values (as seen in Fig. 0b)). The clustering coefficient as dis- 
played in Fig. |5Jb) shows a more stable evolutionary trajec- 
tory. Over a time scale roughly corresponding Xo N - 100 up- 
dating steps C goes from the value of the initial Erdos-Renyi 
value to the higher clustering coefficient of the networked se- 
ceder model. This is natural since it is also roughly the time 
scale for all vertices to be picked and rewired once. The value 
of the modularity Q, displayed in Fig.|5Jc), is shows a simi- 
lar behavior as the clustering coefficient as it increases from 
the value ~ 0.4 of the original random graph to ~ 0.6 of the 
seceder model. C and Q seem to be strongly correlated, some- 
thing that seems very logical in the context of the seceder 



model — the clustering coefficient increases when a high de- 
gree vertex is rewired to a specific cluster, a process that also 
strengthens the community structure. If this strong C-Q corre- 
lation is a more ubiquitous property is an interesting problem 
for future studies. In Fig. |9jd) we plot the overlap y which 
fluctuates between 25 and 75 with an average well below 50. 
These values are lower than we expected a priori, as it means 
than identity of more than half the group members changes 
a typical time step. Just as the fluctuations in r, we expect 
the fluctuations in the cluster structure to decrease with sys- 
tem size, therefore y/N will increase with N. In Fig.^e) the 
time development of different cluster sizes is illustrated. A 
horizontal cross section gives the size partitioning of the ver- 
tex set at a given time step. A demarcated area represents a 
group. Older groups are above younger groups. An obser- 
vation from Fig. |9je) is that groups typically lives between 
one and 100 time steps. The life-time scale of groups seems 
to coincide with that of the initial relaxation to the seceder 
equilibrium. We also note that there seems to be no particular 
correlation between age and stability or size, a situation that 
would have produced skewed life time or cluster size distribu- 
tions. At the bottom of the diagram, hardly visible, there are 
numerous small, short-lived, clusters. This is an effect of iso- 
lates constantly present in the system (for this set of parameter 
values there are typically one or two at a time step). 

The observations in this section were checked for a few 
other runs and seems to be representative. Since they do not 
hint some surprising phenomena (against the backdrop of the 
previous sections and the algorithm itself) we do not conduct 
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any extensive statistical survey of the dynamical properties. 

VI. SUMMARY AND CONCLUSIONS 

We have proposed a model for network formation based 
on the seceder model. The model captures how a commu- 
nity structure can emerge from the desire to be different, both 
in social and economic systems. The community structure of 
our model is analyzed with a recent graph clustering scheme. 
This scheme has the advantage that it gives a measure of the 
degree of community structure in a network — the modularity 
Q. We see that the Q is much higher for our model networks 
than for random reference networks with the same degree dis- 
tributions. Both the number of groups and the average of size 
of groups growth as power-laws with sub-linear exponents. 
Both the average geodesic distance between vertices of the 
same and different clusters grows logarithmically; the differ- 
ence between these, however, is much larger for the networked 
seceder model than for the random reference networks. The 
general picture is thus the the networked seceder model gener- 
ates well-defined communities just like the agents of the orig- 
inal seceder model gets clustered in trait space. 

The networked seceder model gives networks of high clus- 
tering and positive assortative mixing by degree — properties 
that are known to be characteristic of acquaintance networks. 
The degree distribution has a peak around the average degree 
and a exponentially decaying, also that consistent with real 
world observations. 

The dynamics of the communities were briefly investigated 
by defining a mapping between consecutive time steps that 
maximizes an overlap function. Using this method we con- 
clude that the speed of the dynamics is set by the size of the 
system. We see that the clustering coefficient and modularity 
are strongly correlated and that older groups are not necessar- 
ily larger than younger. 

To epitomize, the networked seceder model gives an mech- 
anism of emergent community structure that is different from 
earlier proposed mechanisms in network models J27ll35ll4^) . 
The mechanism is arguably present in, at least, social net- 
works (28). We speculate that this model can be applied to 
networks of companies that are linked if they are active in the 
same market. 
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